Fluctuations in Mass- Action Equilibrium of Protein Binding Networks 
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We consider two types of fluctuations in the mass-action equilibrium in protein binding networks. 
The first type is driven by relatively slow changes in total concentrations (copy numbers) of inter- 
acting proteins. The second type, to which we refer to as spontaneous, is caused by quickly decaying 
thermodynamic deviations away from the equilibrium of the system. As such they are amenable 
to methods of equilibrium statistical mechanics used in our study. We investigate the effects of 
network connectivity on these fluctuations and compare them to their upper and lower bounds. 
The collective effects are shown to sometimes lead to large power-law distributed amplification of 
spontaneous fluctuations as compared to the expectation for isolated dimers. As a consequence of 
this, the strength of both types of fluctuations is positively correlated with the overall network con- 
nectivity of proteins forming the complex. On the other hand, the relative amplitude of fluctuations 
is negatively correlated with the abundance of the complex. Our general findings are illustrated 
using a real network of protein-protein interactions in bakers yeast with experimentally determined 
protein concentrations. 
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The study of dynamical fluctuations in complex sys- 
tems has emerged as a topic of intense interest germane to 
the fields of biology [l[ , financial systems , traffic in in- 
formation [3[ and transportation |4| networks, and many 
others. Of particular interest is the nature of collective 
effects that arise as a consequence of the connectivity of 
the underlying network. By examining such fluctuations 
we can understand when the underlying network plays an 
important role and when, if possible, it may be ignored. 
A good candidate arena to study dynamical fluctuations 
is that of biomolecular processes taking place in cells. 

Recently, propagation of biological fluctuations has 
been studied in the context of genetic regulation [5J] and 
metabolic pathways [f|. These studies are primarily fo- 
cused on small linear cascades of irreversible interac- 
tions. Conversely, we study the related problem of fluc- 
tuations in the mass-action equilibrium state of densely- 
connected, reversible protein-protein-interaction (PPI) 
networks. These networks, in which proteins (nodes) are 
connected by edges if they bind together, exhibit non- 
trivial topological properties such as clustering and loops 
of various lengths. Ourselves and others have studied the 
effect of large systematic changes in the levels of just one 
or a few proteins on the mass-action equilibrium of their 
PPI networks @, H|. Such changes are likely to occur 
as a consequence of regulated response of the cell to big 
changes in the external environment. For the same sys- 
tem, however, there is another type of perturbation that 
is both different and of significant interest: intracellu- 
lar noise or small fluctuations in equilibrium (bound and 
free) concentrations of many proteins. The randomness, 
smaller magnitude, and sheer number of involved pro- 
teins characterize the difference between the noise and 
fluctuations that are the subject of this study and the 



systematic large changes in the total abundance single 
proteins that we recently studied in Q. These fluctu- 
ations come in two varieties. Spontaneous fluctuations 
in bound concentration occur at constant protein copy 
number, due to the intrinsic stochastic nature of bind- 
ing interactions. These fluctuations are small but change 
rapidly relative to the characteristic time of changes in 
protein copy number. They are well described using the 
machinery of equilibrium statistical physics. In contrast, 
driven fluctuations are induced by changes in protein 
copy number due to the stochastic nature of their pro- 
duction and degradation as well as variation in activity of 
global factors controlling the overall protein abundance. 
This driven noise is usually larger than the spontaneous 
noise. It also happens on timescales (tens of minutes) 
that are large compared to the relaxation time of the 
mass action equilibrium which are rarely slower than sec- 
onds. In fact, the problem of thermal noise is well con- 
nected to the static response of the system to systematic 
concentration changes, as we will show. 

To illustrate general principles with a concrete exam- 
ple, in this study, we used a highly curated genome-wide 
network of PPI in yeast (S. cerevisiae), which, according 
to the BIOGPJD database @, were independently con- 
firmed in at least two publications. We combined this 
network with a genome-wide data set of protein abun- 
dances pJp| . After keeping only the interactions between 
proteins with known concentrations, we were left with 
4,185 binding interactions among 1,740 proteins. The 
same network was previously used by us and others in 
Another simplification (partially justified in these 
earlier studies) is that: 1) we consider only homodimers 
and hcterodimers and thus ignore the formation of higher 
order complexes, 2) we use the same dissociation con- 
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stant Kij — InM (or 34 proteins/cell) for all interactions 
in our network. This choice is justified by a relative lack 
of sensitivity of equilibrium concentrations to details of 
assignment of dissociation constants to individual inter- 
actions (see Fig. 5 in Ref. P). 

The PPI network defines the backbone of a system of 
dynamical dimer formation and dissociation. The sys- 
tem at any instant is described by {Ci}, the set of total 
protein concentrations, {D^}, the concentrations of all 
dimers (ij) and {-Fi}, the set of free protein concentra- 
tions. We will assume reactions are occurring in a unit 
volume, in order to suppress the system volume V in 
the equations that follow. At any instant, the system is 
constrained by mass conservation: 



C i = F i + Y J A 



(1) 



so that Fi is not an independent variable. For consid- 
erations of noise we use deviations of dimer concentra- 
tion, SDij, away from their long term averages. The sec- 
ond moments of these fluctuations, (SDfj) quantify the 
strength of the noise. 

To study spontaneous fluctuations, we consider the 
case where all total concentrations Ci are held constant 
and variations in free and dimer concentration are driven 
solely by thermal fluctuations. To this end, we write the 
partition function for a network of interacting dimers: 
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Z = c X p(-G/k B T) = J2 iVs({Aj})exp(- 

(2) 

where the sum is taken over all possible (integer) copy 
numbers of individual dimers defining the "occupational 
state" {Dij}. The combinatorial factor Ns({Dij}) 
counts the number of microstates of individual labeled 
proteins resulting in a given occupation state {Dij}. For 
example, for a single dimer AB, Ns(Dab) is the combi- 
natorial factor: 
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Using the Stirling's approximation for factorials in 
Ns{{Dij}) one gets a concise expression for the free en- 
ergy for an arbitrary network of dimers: 

G = ! f .)Aj + keTDijllogiDij) - 1]} 

(ij)eE 

N 

+ k B Tj2{F&og(Fi) - 1] - Ci[log(Ci) - 1]} 
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where the first sum runs over all E edges (dimers) and 
the second sum runs over all nodes (proteins) in the 
network. Free concentrations F± in this expression are 
not independent variables but rather a shorthand for 



Ci — £ m flj m . The above expression does not include 
volume-dependent entropy and kinetic terms that we 
have suppressed as they are not relevant to our discus- 
sion here. The first derivative of the free energy with 
respect to dimer concentration gives the Law of Mass 
Action (LMA) that relates free and bound equilibrium 
concentrations in the system via D^ = FiFj/Kij, where 
= K^- Q > exp(—eij/ksT). The second derivative of the 
free energy with respect to dimer concentration yields 
the generalized susceptibility and, in accordance with the 
Fluctuation-Dissipation Theorem (FDT), the noise 
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A direct consequence of this result is that spontaneous 
fluctuations for a dimer linked to the rest of the network 
involve contributions from other dimers, through the in- 
verse of r, the so-called collective effects of the network. 
To address the impact of collective effects on the noise, it 
seems natural to compare the noise of a dimer in the net- 
work to the noise for an isolated dimer (isol-F) with the 
same equilibrium concentrations Fi, Fj, and D^. Such 
an isolated dimer corresponds to a matrix T that is diag- 
onal and has a trivial inverse such that: 
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(6) 



Furthermore,?? > n is easily shown from the con- 
vexity of r [12]. Clearly then, collective effects act to 
amplify thermal fluctuations. This is related to propa- 
gation of static perturbations, studied in as fluctu- 
ations from neighboring dimers contribute to a dimer's 
own noise. We define the amplification factor for a dimer 

m 
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A histogram of amplification factors for the PPI network 
of baker's yeast is examined in Fig. [T] Relative to the 
isolated case, collective amplification can lead to thermal 
noise that is orders of magnitude larger, as is evident 
from this histogram. The distribution has a power-law 
tail with an approximate exponent of /3 ~ 1.5. 

Collective amplification of thermal noise presents a 
worrisome theoretical possibility. Can amplification oc- 
cur without limit? To address this question, it is fruitful 
to develop an alternative formalism in which the mag- 
nitude of fluctuations are calculated directly from the 
partition function. Using Eq. [2] it is straightforward to 
show, by a change of variables, that higher moments of 
D^ can be related to the lower moments evaluated at a 
reduced system size. In particular: 

(A J -(A J '-i)) = (A J -)(A J -)|c i -i,c,-i 
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FIG. 1: Histogram of amplification factors for spontaneous 
(thermal) noise of equilibrium dimer concentrations Dij in the 
PPI network of yeast. Collective effects lead to amplification 
relative to the isolated dimer null model. Large amplification, 
up to several orders of magnitude, is sometimes achieved. 



where the latter moment is evaluated in a system for 
which the copy number of proteins i and j (Cj and Cj) 
are reduced by exactly one. It follows that the noise may 
be alternatively expressed as: 
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The above expression for thermal noise hints at a connec- 
tion to static perturbations of total concentration. This 
connection can be made even more explicit by expanding 
the 2nd term to first order in total concentration: 
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where the matrix: 
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AtJ " dlogF, 
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characterizes the response of equilibrium concentrations 
F m to small static changes in total concentrations Ck Q • 
It should be remarked that, despite the approximation 
used in Eq. [51 this approach is in good agreement with 
the FDT formalism first introduced. One notes that this 
expression for noise explicitly depends only on the to- 
tal and dimer concentrations used to define the matrix 
A. This suggests the definition of a new isolated model 
(isol-C), consisting of an isolated (ij) dimer formed by 
proteins with the same Cj , Cj and . This is only pos- 
sible through changes in the dissociation constant and 
free concentrations of constituent proteins i and j. It is 
important to mention that this model is distinct from the 
isol-F benchmark defined earlier, in which each isolated 
dimer has the same equilibrium free and dimer concen- 
trations (yet different C, and Cj) as the corresponding 



dimer in the network. For an isol-C dimer, the matrix A 
is 2x2 and trivially invertible. The noise is given by: 
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A comparison with the isol-F model reveals that a dimer 
in the isol-C model has an equilibrium free concentra- 
tion Fi — Fi +J2k Dik and similarly for protein j. In 
other words, the contribution of neighboring dimers to 
the noise of dimer (ij) has been included by absorb- 
ing them into an effective free concentration. Moreover, 
where the isol-F model completely ignores the effect of 
neighboring dimers, the isol-C model brings neighboring 
sources of noise one step closer to dimer (ij). Conse- 
quently, the noise of a dimer in the isol-C model always 
exceeds the noise of a corresponding dimer in the real 
network. The real noise for a dimer in a network falls be- 
tween the bounds of the two isolated dimer noise predic- 
tions. A summary of the lower- and upper-bound models 
and their noise is given in Fig. The actual spontaneous 
fluctuations achieved are a result of real network topol- 
ogy and the distribution of total protein concentration. 
It is natural to ask how these fluctuations compare to 
their minimally and maximally achievable values. This 
suggests the coordinate transformation: 



V = C 1 - Ov 
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A histogram of £ for the PPI network of yeast is shown 
in Fig. [3] Of particular note is the large pilcup against 
the upper limit of amplification. In real PPI networks, 
it would seem that collective effects lead to amplification 
quite close the maximally achievable limit. 

Now we turn our attention to the second type of noise 
driven by stochastic changes in protein copy number Cj. 
In the living cell, these fluctuations are typically much 
larger than the spontaneous fluctuations. Furthermore, 
the changes in Cj occur at a relatively slow time scale 
(tens of minutes) so that the mass-action equilibrium re- 
spond to these changes. From the results of @, H[ it fol- 
lows that, in general, the amplitude of driven fluctuations 
is given by: 



= <(£ (A-^rJC, + J2 (A^WCm) 2 ) (11) 



The evaluation of the above expression requires the 
full matrix of cross-correlations (SCkSC m ) which is cur- 
rently experimentally unknown. For the simplest case 
(SCk5C m ) oc C^Smk of uncorrelated fluctuations (so- 
called intrinsic noise [H), the driven response becomes: 
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In conclusion we study how the two types of noise stud- 
ied above relate to simple predictors such as abundance 
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(c) isol-C model 
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rj = diag 
(b) real noise 
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taneous and driven (intrinsic) noise is negatively corre- 
lated with dimer abundance (Spearman coefficient 
of r = —0.98, r = —0.64, respectively). This result is 
generally expected for relative noise amplitudes due to 
the law of large numbers. Indeed, for independent fluc- 
tuations it is expected to decrease as ~ 1/ WlJij ■ This 
explains a particularly strong correlation in the case of 
spontaneous fluctuations. Furthermore, we found that 
relative amplitude of both spontaneous and driven (in- 
trinsic) noise are positively correlated with connectivity 
(r = 0.46, r = 0.34). This is consistent with the overall 
scenario that we investigated above in which any type 
of noise propagates throughout the network and network 
connections (both direct and, to some extent, indirect) 
to noisy partners positively contribute to fluctuations of 
individual dimers. 
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FIG. 2: A comparison of the noise in a network dimer to 
two isolated dimer models defined in the text, (a) The isol-F 
model: Each dimer (ij) is isolated and has the same protein 
free- (Fi, Fj) and dimer- concentrations (Dij) as the corre- 
sponding dimer in the network. This model ignores the con- 
tribution of other dimers to the noise of dimer (ij) (b) The 
noise of a dimer (ij) in the network is given by the (ij), (ij) 
diagonal element of the inverse of the matrix F as described 
in the text, (c) The isol-C model: Each dimer (ij) is isolated 
and has the same protein total- (d, Cj) and dimer- concen- 
trations (Dij) as the corresponding dimer in the network. The 
real noise is bound below and above by the isolated models 

n iBOl ~ F <v< v iBOl ~° 




FIG. 3: Histogram of the spontaneous noise coordinate £ in 
the PPI network of yeast. The coordinate describes the posi- 
tion of noise amplitude relative to its lower (£ = 0) and upper 
(£ = 1) limits described in the text. 



and connectivity (number of connections a dimer has to Work at Brookhaven Nat i nal Laboratory was carried 
the network). With high stati stical s ignificance, we find Qut under Contract No _ DE-AC02-98CH10886, Division 
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